Skip to content

docs: Retitle and improve the existing guides#939

Open
vdusek wants to merge 10 commits into
masterfrom
docs/improve-guides
Open

docs: Retitle and improve the existing guides#939
vdusek wants to merge 10 commits into
masterfrom
docs/improve-guides

Conversation

@vdusek

@vdusek vdusek commented Jun 5, 2026

Copy link
Copy Markdown
Contributor

Improves the existing framework guides — clearer titles, flatter example code, and a few content fixes. The id: slugs and URLs are unchanged, so existing links keep working.

  • Retitle guides 01–06 to action-oriented names (e.g. "Use Crawlee" → "Building crawlers with Crawlee").
  • Flatten the scraper examples (BeautifulSoup, Parsel, Playwright, Selenium): extract a scrape_page helper and track crawl depth via Crawlee's built-in Request.crawl_depth instead of a manual user_data counter. Each stays a single file with one runnable "Run on Apify" block.
  • Crawlee & Scrapy guides: tidy the Apify Proxy wording, and fix the Scrapy __main__.py tab label/title plus a grammar nit.
  • Web server guide: fix the Actor.configuration.web_server_url/web_server_port references (the prose used non-existent container_* attributes), add a FastAPI example and an Actor Standby section, and renumber it 07 → 12 to make room for the new guides.
  • Quick-start: refresh the guides list.

TODO before merging

@vdusek vdusek added adhoc Ad-hoc unplanned task added during the sprint. t-tooling Issues with this label are in the ownership of the tooling team. labels Jun 5, 2026
@vdusek vdusek self-assigned this Jun 5, 2026
@github-actions github-actions Bot added this to the 142nd sprint - Tooling team milestone Jun 5, 2026
@codecov

codecov Bot commented Jun 5, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 89.93%. Comparing base (0daca28) to head (e9011da).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #939      +/-   ##
==========================================
+ Coverage   89.90%   89.93%   +0.03%     
==========================================
  Files          49       49              
  Lines        3091     3091              
==========================================
+ Hits         2779     2780       +1     
+ Misses        312      311       -1     
Flag Coverage Δ
e2e 35.91% <ø> (ø)
integration 56.90% <ø> (+0.03%) ⬆️
unit 78.74% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@vdusek vdusek changed the title docs: improve guides — consistent titles and flatter scraper examples docs: Flatten scraper examples and fix guide inaccuracies Jun 5, 2026
@vdusek vdusek force-pushed the docs/improve-guides branch from 0c1be46 to 65f8e0d Compare June 5, 2026 11:24
@vdusek vdusek marked this pull request as ready for review June 5, 2026 11:27
@vdusek vdusek requested a review from szaganek as a code owner June 5, 2026 11:27
@vdusek vdusek requested a review from janbuchar June 5, 2026 11:27
@vdusek vdusek changed the title docs: Flatten scraper examples and fix guide inaccuracies docs: Retitle and streamline the existing guides Jun 5, 2026
@vdusek vdusek requested review from Mantisus and removed request for janbuchar June 5, 2026 18:56

@Mantisus Mantisus left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Only one suggestion.

Comment thread docs/03_guides/code/03_playwright.py Outdated
@vdusek vdusek changed the title docs: Retitle and streamline the existing guides docs: Retitle and improve the existing guides Jun 9, 2026
vdusek added a commit that referenced this pull request Jun 12, 2026
…921)

Reviews the SDK documentation (Introduction, Concepts, Upgrading)
against the codebase — fixes verified inaccuracies, unifies wording, and
fills a few gaps. Does not touch the Guides section; those changes live
in #939.

## Correctness fixes

- **Storages**: documented the real `Dataset.export_to`
(`content_type='csv'|'json'`); `export_to_csv`/`export_to_json` don't
exist.
- **Upgrading to v2**: `Actor.start_task` → `Actor.call_task` (accepts
`Webhook`, returns `ActorRun`).
- **Introduction**: Python requirement `3.10+` → `3.11+`.
- **CONTRIBUTING**: development Python requirement `3.10` → `3.11`.
- **PyPI metadata**: the "Release Notes" URL in `pyproject.toml` pointed
at the v2 upgrade guide; now points at the v4 guide (wheel metadata is
immutable after publish).
- **Actor events**: corrected the `SYSTEM_INFO` / `MIGRATING` payloads
and `Config` → `Configuration`, each event now links its real data type;
fixed `04_actor_events.py` (passing a Pydantic model as logging `extra=`
raises `TypeError`).
- **Configuration**: `APIFY_PERSIST_STORAGE=0` →
`APIFY_PURGE_ON_START=0`.
- **Actor lifecycle**: fixed an empty `<ApiLink>`.
- **Quick-start**: corrected the `__main__.py` description and tab
label.

## Consistency & additions

- Consistent capitalization ("Apify Proxy", "Apify Console", "Actor");
assorted grammar and upgrade-guide heading fixes; made an absolute
self-link relative.
- New **Storage clients** concept page with two runnable snippets
(default `SmartApifyStorageClient`, single vs. shared request queue,
`force_cloud`, custom client via `service_locator`); re-exported
`StorageClient` from `apify.storage_clients`.
- Added a `## Conclusion` to every concept page, expanded the
Introduction feature list, and added a Configuration options table.

## TODO before merging

- [ ] Mirror these changes into `website/versioned_docs/version-3.4/` so
the published docs version reflects them, not only under "next": apply
the edits to the existing Introduction, Concepts, and Upgrading pages,
and clone the new Storage clients page
(`docs/02_concepts/12_storage_clients.mdx` + its `code/12_*.py`
snippets).
# Conflicts:
#	docs/03_guides/07_running_webserver.mdx
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

adhoc Ad-hoc unplanned task added during the sprint. t-tooling Issues with this label are in the ownership of the tooling team.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants